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(57) Attract 

A computer comprises a plurality of processor modules 
(1 lOn-l H>nX each having at least fir* and second I/O connection 
interfaces (126, 130A, 130R, 130C, I30D), a processor (122) con- 
nected to those interfaces (126, L30A, 130B, 130C, 130D), and 
read-write memory (1 18) connected to the processor. The first I/O 
(126) of each processor module <I IQo-UQn) *» 
bus (138). The second I/O connection in- 
terface (130A, 13GB, 130C. 130D) of each processor module 
(1 KV1 10n)b connected to a twitch (150A, 150B, 150C, I50DX the 
switch (150A, 150B, 150C, 130D) bein& operative to connect the 
second I/O connection interface (130 A, 130B, 130C, l3oD) of a se- 
lected p ro ce ssor module (UOo-UOh) selectively to the second I/O 
connection interface (130A, 130B, 130C, 130D) of any other pro- 
cessor module (MOo-llOjs,). A controller (142) is connected to the 
bus (138) for receiving over the bus (138) data pertaining to A first 
processor module (IIOq-HOn), which requires access to informa- 
tion, and to a second processor module (1 IOq-1 IOn), from Which 
the required information is available, and for controlling the switch 
(150 A, I50B, 150C, 150D) to allow the required information to be 
transmitted from the first processor module (HOo-HOn) to the sec* 
(HOo-llO*). 
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COMPUTING MACHINE WITH 
HYBRID COMMUNICATION ARCHITECTURE 

Background of the Invention 
This invention relates to a computing machine 
5 with hybrid communication architecture. 

A digital computer solves a problem by break- 
ing the problem down into multiple steps, a compu- 
ter with a single processor is able to execute one 
step at a time. It takes an inordinate time to 
10 solve a complex problem by use of such a sequential 
mode of operation. By operating multiple proces- 
sors in parallel/ it is generally possible to 
eeduce substantially the time required to solve a 
problem* 

X5 If multiple processors are operated in 

parallel, it is necessary for the processors to 
share data, one technique for sharing data among 
multiple processors is for there to be a common, or 
global, memory to which all the processors have 

20 access on an equal footing. A problem with this 
technique arises from the fact that only one pro- 
cessor can access the memory at a given time, and 
therefore contention -problems limit the number of 
processors that can be accommodated* The number of 

25 processors Can be increased somewhat by use of 

coherent caching or crossbar switching, but such 
techniques are costly and cumbersome. 

A second method of allowing sharing of data by 
multiple processors involves use of a parallel 

30 communication bus. A bus allows a great deal of 

flexibility in communication, including the ability 
to broadcast data from one processor to many or all 
of the others in a single operation. Each proces- 
sor is able to execute independently until the need 

35 
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co communicate arises. However, when communication 
is necessary, contention problems arise since only 
one processor can transmit on the bus at a time* 
Use of multiple buses can reduce problems due to 
5 contention, but multiple buses also reduce flexi- 
bility and add greatly to cost and complexity. 

A third technique for allowing multiple pro- 
cessors to share data is provided by point-to-point 
communication links, processors that have built-in 

10 links are commercially available, and therefore 
they are very easy to provide. Links offer 
virtually unlimited expansion possibilities, since 
che number of communication paths increases 
whenever a processor is added. However, links are 

15 the most difficult to use, since the physical 

interconnection pattern must match the pattern of 
communication required by the program that is being 
. executed. if processors are added to or removed 
from the system, a new pattern of link connections 

20- must be established, and the program must be 
rewritten, recompiled or, at the very least, 
relinked to match. Broadcasting a message is 
difficult and time consuming, sinee the message 
must be copied from one processor to the next until 

25 it has reached all processors. Since two different 
programs will generally require two different 
physical interconnection patterns, it has not 
hitherto been possible to execute multiple programs 
simultaneously using links for communication 

30 between processors unless the programs are 
specifically designed- to require the same 
communication patterns. 

The difficulty of matching the physical 
interconnection pattern with the. pattern of commu- 
35 nication required by the program is partially 
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The read operation is identical to in except 
that the matching tuple is not removed from the 
tuple space. The eval operation is a specialized 
form of out . Out creates a passive tuple, whereas 
5 eval creates an active tuple. 

If a processor performs an in operation, it is 
necessary to search tuple apace for the matching 
tuple. It would be time consuming to examine each 
tuple in turn to determine whether it matches the 
10 template, and therefore the tuples are classified 
and a directory is created to facilitate the 
search, in a multi-processor computer, different 
portions of the directory are accessed by the 
different processors, and In order to complete an 
15 AH operation, potentially all the processors must 

examine their portions of the directory against the 
template in order to determine whether a matching 
tuple exists. 

The Linda system makes the programmer's job 
2Q vastly easier; since he need not know the source or 

the destination of his data. The directory is 
automatically consulted and used to match a request 
for a tuple with a tuple that Is available. When a 
required tuple is not available, the requester 
25 waits until such a tuple becomes available. The 

Linda system can be implemented on any of the 
above-mentioned parallel processing architectures, 
most efficiently with global memory, next most 
efficiently with one or more buses, and least 
30 efficiently with links* 

Maintenance of the Linda distributed directory 
id well suited to the bus, because it enables 
notification to be broadcast to potentially inter-, 
ested processors as tuples becomes available. The 
35 notification messages are short, and therefore a 
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alleviated by adding a link crossbar switch, by 
which the configuration of the physical intercon- 
nection pattern can easily be changed at the start 
\ of a program to match the communication pattern 
5 required by the program. However, use of a link 
crossbar switch in this manner does not ease the 
job of defining a suitable connection pattern, nor 
does it facilitate simultaneous execution of two or 
more programs using the same set of processors. 
10 D - Gelernter, Generative Communication in Linda, 

ACM Trans, prog, tang, and Sys., Vol. 7, NO. 1, pages 
80-ll2 r (1985) describes a software system called 
Linda. Linda is based on the use of tuples. A tuple 
is a collection of related data. Elements of a tuple 
15 are fields holding actual values or formals. There 
are two types of tuples, namely passive tuples and 
active tuples. A passive tuple is simply a collec- 
tion of data items, whereas an active tuple is a 
process which becomes a passive tuple. Tuples exist 
20 **i an abstract space called tuple space. The tuple 

space may exist over multiple processors. Four prin- 
cipal operations can be performed on tuple space. 
The out operation is an operation that creates a . 
tuple and places it in tuple space. The in operation 
25 i« the reverse of out ; it specifies a tuple that it 
desires, in the form of a template, and the computer 
matches the template against all the tuples existing 
in tuple space. If a matching tuple is found, it is 
removed from tuple space and is returned to the 
30 requesting process. When no tuple matches, the in 
operation blocks, and the requesting process is 
suspended until another process, through an put ■ 
operation, creates a matching tuple. At this point, 
the requesting process continues. An out operation 
33 can never block. 
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great volume of thorn can be moved over the bus in a 
abort period. However, if a tuple, which might 
contain a large block of data, is moved over the 
bus, updates to the directory may be blocked for 
5 long periods, resulting in poor system performance. 

using links to maintain the Linda directory 
is very inefficient, since broadcasting requires 
repeated passing of the message from processor to 
processor. However, moving data blocks from produ- 

10 eer to requester is an operation to which links are 
well suited. Since there ore many links, many such 
transfers can be taking place simultaneously if the 
right link connections are made. 

Other publications that refer to Linda indlude 

15 D. Gelernter, Programming for Advanced Computing, 
Scientific American, October, 1987, pages 91-9B z 
R. Bjornson, N. Carriers, D. Gelernter and J. 

Leichter, Linda, the Portable Parallel, 

<19BB); N. Carriero and D. Gelernter, Applications 

20 Experience with Linda, (1988); tf. 

Carriero, D. Gelernter and J. Leichter, Distributed 
Data Structures in Linda, Proc. ACM Symp. on Prin- 
ciples of Prog. Lang., pages 236-242 (1986); D. 
Gelernter, N. Carriero, S. Chandrsn and S. Chang, 

25 parallel Programming in Linda, Proc. Int. Conf. on 
Parallel Processing, pages 255-263 (1985); 
N. carrier© and 0* Gelernter, The S/Nefs 
Linda Kernel, ACM Trans. Comp. Sys-, vol. 4, NO. 2 # 
pages 110-129 (1986); N« Carriero, Implementation 

30 of Tuple Space Machines, Yale university, Dept. of 

computer Science, Research Report YALEU/DCS/RR-567 
(1987); N. Carriero and D. Gelernter, Integrating 
Multiple Tuple. Spaces, the File System and process 
Management in a Linda-based Operating System, Yale 

35 university. Dept. of Computer Science, Research 
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Report vALSC/DCS/RR-TechniCel Memo (1988); M. 
Factor, and D. Galernter, The parallel Process 
Lattice as an Organizing System for Realtime 
Knowledge Daemons, , (1989) » 

'3 

Summary of the Invention 
a preferred embodiment of the present inven- 
tion is a computer which comprises a plurality of 
processor modules, each having at least first and 

10 second I/O connection interfaces, a processor con- 
nected to those interfaces, and memory connected to 
the processor. The first I/O connection interface 
of each processor module is connected to a common 
bus. The second i/o connection interface of each 

15 processor module is connected to a switch, the 

switch being operative to connect the second I/O 
connection interface of a selected processor module 
selectively to the second t/O connection interface 
of any other processor module* a controller is 

20 connected to the bus for receiving over the bus 

data pertaining to a first processor module, which 
requires access to information, and to a second 
processor module, from which the required informa- 
tion is available, and for controlling the switch 

25 to allow the required information to be transmitted 
from the first processor module to the second pro- 
cessor module by way of the switch. 

in a computer embodying the present invention, 
the Linda directory can be updated by broadcasting 

30 a template over the bus, and when a processor 
module that has access to a matching tuple is 
identified, the tuple is transmitted from the pro- 
ducing processor to the requesting processor in a 
non-broadcast fashion, by way of the switch. During 

35 the transmission,- the bus is available to process 
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other updates and arrange other connections, so 
that many transmissions can occur simultaneously. 

Brief Description of Drawings 
5 For a better understanding of the invention, 

and to show how the same ™*y be carried into 
effect, reference will now be made, by way of 
example, to the accompanying drawings in which; 

FIG. 1 Is a block diagram of a first networked 
10 computer system embodying the present invention, 

fig. 2 is a more detailed block diagram of one 
of the computer stations shown in fig ♦ 1, 

FIG. 3 is a Simplified block diagram of a 
second networked computer system embodying the 
15 invention, and 

FIG. 4 is a simplified block diagram of a 
stand-alone computer embodying the invention. 

Detailed Description 

20 The computer system illustrated in FIG. 1 

comprises a user terminal 10, several computer 
stations 12 and a disc drive station 14. The user 
terminal 10 comprises a processor 16 which is 
connected to various user utilities, such as a 

25 display card 18, a hard and/or floppy disc drive 

card 20, and a keyboard card 24, through its memory 
bus 26. The memory bus is also connected to local 
random access memory (RAM) 28 and local read only 
memory (ROM) 30. The processor 16 has four link 

30 interfaces 44. A second processor 42 is connected 

over its memory bus to local BAM 48 and has four 
link interfaces 52. In the preferred embodiment of 
the present invention, each processor 16, 42 is an 
Inmos IMS T800 transputer. One link interface of 

35 the processor 16 is connected to a link interface 
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of- the processor 42, and the other three link 
interfaces of each processor are connected to 
respective demultiplexed interfaces of a 6:1 byte 
domain mult iplexer/damult iplexer (mux/demux) 60 , 
5 which has a multiplexed interface connected to a 
fiber optic transceiver 64. The mux/demur 60 and 
. the fiber optic transceiver 64 are used to transmit 
data and instructions between the processors 16 $ 42 
and a fioet optic cable 69, which connects the user 

10 ' terminal 10 to one of the computer stations 12. 

Messages are transmitted over the cable 68 at a 
rate of about 100 Mb/s. 

• The mux/demux .60 has a multiplexer channel and 
a demultiplexer channel. The multiplexer channel 

15 of the mux/demux comprises* for each demultiplexed 
interface, a serial to parallel converter which 
receives serial data over its link in words of 
eight, bits at a rate 10-20 Mb/s and, for each 
eight-bit serial word, generates an eight-bit 

20 parallel word and applies it to a parallel bus with 
a four-bit tag that designates the particular 
demultiplexed interface that provided the serial 
word. Thus, for each eight-bit word received at a 
demultiplexed interface, a twelve-bit word is 

25 applied to the parallel bus. The parallel bus is 
connected to a high* speed parallel-to-serial con- 
verter, which reads the twelve-bit parallel words 
in turn and generates an electrical signal composed 
of a succession of twelve-bit serial words. The 

30 electrical signal provided by the parallel-to- 
serial converter is applied to the fiber optic 
transceiver 64* 

The demultiplexer channel of the mux/demux 60 
comprises a ser ial-to-oar allel converter which 

35 receives twelve-bit serial words from the fiber 
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optic transceiver 64 and generates twelve-bit 
parallel words which are applied to the parallel 
bus. Bach twelve-bit word comprises a four-bit tag 
and an eight-bit data word. Six parallel-to-serial 
3 converters, which ace connected to the demulti- 
plexed interfaces respectively, read the parallel 
bus* The par allel-to~ser lal converter that Is to 
read a particular data word is designated on the 
basis of the four-bit tag associated with the data 

10 word. Thus, in the transmit mode the mux/demux 

receives up to six serial signals and interleaves 
them to provide a single aerial output signal at 
its multiplexed interface. The output signal of 
the mux/demux is applied to the fiber optic trans- 

15 ceiver €4, which launches an optical signal, coded 
in accordance with the 3ignal provided by the 
mux/demux, into the fiber optic cable 68. In the 
receive mode, a coded optical signal is received 
over the fiber optic cable and the fiber optic 

20 transceiver generates a serial electrical signal in 
response thereto. The serial electrical signal is 
applied to the multiplexed interface of the 
mux/demux 60 and the mux/demux demultiplexes it 
into up to six signals which are provided at the 

25 demultiplexed interfaces respectively of the 

mux/demux . 

When data or commands are being input into the 
system, the processors 16, 42 manipulate the data 
Or commands in accordance with programs stored in 

30 the ROM 30 and apply the data or commands in serial 

fashion over the links, the mux/demux 60 and the 
transceiver 64 to the fiber optic cable 68. When 
data or commands are received by the transceiver 
over the fiber optic cable 68, the serial signal is 

35 demultiplexed into up to six serial signals which 
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are applied to the processors 16 r 42 over the 
links, and the processors apply the data or 
commands to the display card 13 or the disc drive 
card. 20. 

5 As shown in PIG. 2, each computer station 12 com- 

prises several processor modules 110, each of which 
, is composed q£ a processor 114 and a random access 
memory 118. In the preferred embodiment of the 
present: invention, the processor 114 Is en Inmos 
10 IMS T800 transputer, comprising a CPU 122, an 

external parallel interface 126 and four link inter- 
faces 130A-130D. The processor 114 also includes 
other components, but these are not relevant to an 
understanding of the present invention and there- 

15 fore are not Illustrated in the drawings. 

The CPCT 122, memory 118, parallel interface 
12* and link, interfaces 130A-133Q of each processor 
114 communicate with each other over an internal 
32-bit parallel bus 134- The parallel interface 

20 126 is connected to an external 32-bit parallel bus 
138, which is connected to the external parallel 
interface of each other processor module, a cross- 
bar switch operator 142 and at least one external 
communication module 144. In the preferred embodi- 

25 ment of the present invention, the crossbar switch 
Operator 142 is an Inmos ins T414 transputer, which 
has essentially the same architecture as the IMS 
T800 transputer. . 

The link interfaces of the processor modules 

30 a£e connected to a programmable crossbar switch 

146, which is implemented by four inmos IMS C004 
programmable link switches 150A-15GD. Each link 
switch has. 32 data link connections. Respective 
data link connections of the link switch 150A, for 

35 example, are connected to the link interfaces 130A 
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of the processors 114. Each link switch also has a 
configuration link connection 158 over which it 
receives a signal for establishing the manner in 
which the data link connections are connected by 
3 the switch. The configuration link connections 15B 

of the four link switches are connected to respec- 
tive link interfaces of the operator 142. 

The crossbar switch 146, the switch operator 142 
and the parallel bus 138 are all carried by a mother 
10 board having sixteen connection slots. Each connec- 
tion slot can receive either a communications card, 
which carries a communication module 144, or a pro- 
cessor card, which carries two processor modules 110* 
Therefore, the maximum number of processor modules 
15 that can be accommodated (it being necessary to have 
at least one communication module) is thiry. The 
four programmable link switches provide an aggregate 
of 128 link connections, and the maximum of thirty 
processor modules occupy 120 of these data link 
20 connections. The other eight link connections are 
connected to the external communication module, for 
purposes which will be described below. If there is 
more than one communication module (and consequently 
fewer than thirty processor modules) , each communica- 
25 tion module is connected to two link connections of 
each link switch. 

Referring again to FIG, 1, the disc drive 
Station 14 comprises a single processor 160 which 
is connected over its memory bus 168 to a high 
30 speed disc drive card 162 and to local RAM and 

local ROM. The processor 160, which may be an 
inmos IMS T800 transputer, has four link interfaces 
which are connected to a 4:1 nfux/demux 164. The 
mux/demux 164 of the disc drive station 14 is 
35 essentially the same as the mux/demux 60 of the 
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user terminal 10, and is connected to a fiber optic 
transceiver 16©. The disc drive station provides 
high speed disc access without being burdened by 
the need to generate a display. 
5 The computer stations 12 shown In FIG. 1 each 

have two communication modules 144 , which enable 
the stations 12 to be connected in a linear 
arrangement between the user terminal and the disc 
drive station. Each computer station therefore can 

10 accommodate up to twenty-eight processor modules 

110* Alternatively, the user terminal 10, the disc 
drive station 14 and the computer stations 12 may 
be connected in a star arrangement, as shown in 
FIG, 3* tn this arrangement, the computer stations 

13 . 12A-12D each need only one communication module, 

but the computer station 12E has six communication 
modules. A third possibility is Ear the computer 
stations to be organized as a hierarchical tree* 
Numerous, other arrangements, .employing di (Cerent 

20 interconnection schemes among the user terminal, 

the disc drive station and the necessary processor 
nodules, are possible* 

Each computer station 12 executes an applica- 
tion by use Of the Linda language. . Data are 

25 received by the computer station by way of an 

external communication module 144 and are stored in 
the external memories lift* The data stored in the 
memories 118 are associated as tuples, and each 
external memory 118 includes a directory portion 

30 containing information regarding the tuples that 

are* stored, in that memory. When the processor 114 
of a processor module Llo executes an out opera- 
tion, the tuple generated in the out operation is 
loaded into the processor module's external memory 

35 and the processor module's portion of the directory 
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is updated to reflect addition of this tuple. When 
a processor module performs an in operation, it 
first examinee its own portion of the directory 
against the template that defines the desired 
5 tuple. If no match is found, the requesting 

processor module may broadcast the template over 
the parallel bus 138 to other processor modules of 
the computer station. The receiving processor 
modules examine their respective portions of the 

10 directory, and the first processor module that 

finds a match places a signal oh the bus to indi- 
cate that the other processor modules should cease 
the search. The requesting and producing processor 
modules then provide signals to the operator 142, 

15 and the operator responds by causing the switch 146 
to establish a connection between a link Interface 
of the requesting processor module and the corres- 
ponding link interface of the producing processor 
module. The matching tuple is then transmitted 

20 from the producing processor module to the 

requesting processor module through the links and 
the crossbar switch 114, and does not occupy the 
bus 138. 

When an ir± or read operation takes place, the 
25 directory portion of the requesting processor 

module is updated to reflect the fact that the 
tuple space of that processor module now contains 
the specified tuple. Similarly, when an j_n opera- 
tion (but not a read operation) is performed, the 
30 directory portion of the producing module is 

updated to reflect the fact that it no longer has 
the specified tuple in its tuple space. When an 
out or eyal operation is performed, the directory 
portion of the module that executes the opera- 
35 tion is updated. It will therefore be seen that it 
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is not. necessary to burden the bus with messages 
pertaining to the contents of the tuple space of 
each processor module* 

Bus operation Has four distinct cycles. A 

. 5 " processor that requires access to the bus 138 in 
\ order to transmit a message asserts a bus request 

signal on a bus control line and an arbitration' 
. cycle takes, place, if no other processor requires 
access to the bus at that time, the first-mentioned 

10 processor wins the arbitration by default. If one 
Or more other processors requires access to the 
bus* distributed arbitration logic ensures fair 
arbitration among the processors that require 
access* When the arrbitration cycle is complete, a 

15 selection cycle takes place. The transmitting 

processor writes a single 32-bit word onto the bus* 
If the computer station has twenty-eight processor 
modules and two communication modules* twenty-eight 
- bits of this word define,, on a one bit per module 

20 basis, a mask of the processor modules that are to 
receive the ensuing message. Two more bits deter- 
mine whether the external communication modules are 
to receive the message. This is the selection 
operation. Thus, with a single bus cycle, the 

25 transmitting processor can select any one or more 
of the other processors to receive its message. 

Each processor that is selected by the trans- 
mitting processor to receive its message receives an 
Interrupt from its parallel interface. The interrupt 

30 forces the receiving processor into a receive message 
' mode, in which each receiving processor reads the 
parallel bus. The transmitting processor receives a 
status bit that indicates whether a receiving processor 
is in the receive message mode, and does not transmit 
. 35 data until all the receiving processors are in the 
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receive message mode. When all the selected pro- 
cessors are in the reeeive message mode, the 
transmitting processor and the receiving processors 
are interlocked and the selection cycle is com- 
5 plete. A transmission cycle then occurs, in which 
the transmitting processor transmits its message 
over the bus in voids of 32 bits. The transmitting 
processor holds its data word on the bus until it 
has received an acknowledgement bit from each 

10 receiving processor* The acknowledgement bit indi- 
cates that a receiving processor has read the data 
word from the bus. The transmitting processor then 
ends its write cyele and each receiving processor 
ends its read cycle. The transmitting processor 

15 then enters another write cycle, in which the next 
data word is transmitted. The first word placed on 
the bus during the transmission phase represents 
the number of words to be transmitted. The 
receiving procesors count the number of words 

20 actually transmitted, and when the number trans- 
mitted is equal to the number represented by the 
first word, they process the message. The trans- 
mitting processor enters a disconnect cycle in 
which it negates its bus request* and this allows 

25 arbitration to take place again. The processors 

that were previously selected are deselected* The 
transmission cycle is then complete* and the bus is 
available for another transmission. 

In the system described, transmissions are 

30 accomplished without use of FIFO buffers. How- 
ever, this is done at the expense of requiring the 
receiving processors to all operate synchronously 
during a transmission, and this might not be 
desirable. Therefore, each processor module could 
35 include a FIFO buffer for receiving and temporarily 
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holding data transmitted to that processor module 
over the bus* In this fashion, the receiving 
processors are able to tun independently of each 
other. Of course* a FIFO buffer has' limited eapa- 
S city, and if one or more of the buffers is filled 
during a transmission, it would be necessary to 
fall back on the previously-described mode of 
transmitting data from a transmitting processor to 
the receiving processors* 

10 Each link Is composed of two lines, allowing 

bi-directional communication between two proces- 
sors. The producing processor transmits words of 
eleven bits serially over the link using one line, 
the first bit, of each word being a start bit and 

13 the last two bits being an ending. Qn receipt of 
the first bit of a word, the requesting processor 
transmits an acknowledgment to the producing pro- 
cessor over the other line of the link. The 
acknowledgment code is a unique two-bit code and is 

20 received by the producing processor before it com- 
pletes transmitting its word. In fact, the length 
of the serial word is such that the acknowledgment 
code can travel through three link switches and 
still be received by -the producing processor, indi- 

25 - eating that the next word can be sent, before the 
transmission of the first word is completed. 
Accordingly, the producing processor can send a 
second word immediately after the first word, 
without having to wait until after the end of the 

30 first word to receive an acknowledgement. 

By' use of the hybrid communication architec- 
ture that has been described and illustrated, the 
computer station shown in FIG, 2 is able to recon- 
figure the pattern of link connections among its 

35 processors dynamically, in response to changes in 
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the communication pattern required by the program 
that is being executed. Also, multiple programs 
can be run simultaneously* without regard to the 
communications patterns required by the programs 
3 respectively, since the pattern of link connec- 
tions is not fixed at the start of a program. 
The programmer need not consider the communica- 
tions pattern that will be required by a program 
that he is writing, since a suitable pattern of 

10 link connections is established automatically. 

The user terminal 10 can run the same programs 
as the computer stations 12* However, when it is 
used as the terminal for a network, the processors 
16 and 42 do not run applications but are concerned 

15 with graphics, creating the display, reading the 
keyboard and accessing the disc. The disc drive 
station 14 is particularly useful if disc access 
cannot be accomplished by the user terminal 10 with 
sufficient speed when it is having to perform other 

20 functions. 

As noted previously, each computer station 12 
includes at least one communication module 144. A3 
shown in FIG. 2, each communication module com- 
prises a processor 170, such as an inmos IMS T800 

25 transputer, having s parallel bus interface and 

four link interfaces- The parallel bus interface 
is connected to the parallel bus 138 and the four 
link interfaces are connected to respective link 
interfaces of a 12:1 mux/demux 172, The other 

30 eight link interfaces of the mux/demur are con- 
nected to eight link connections of the switch 146. 
Except for the number of link interfaces, and con- 
sequently the potential maximum speed of operation, 
the mux/demux 172 of the communication module 

35 operates in essentially the same way as the 
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mux/demux 60 of the user terminal 10. The communi- 
cation modules enable the computer stations to 
exchange tuple requests and tuples. 

When a requesting processor of a computer sta- 
5 tion 12 generates a mask: that defines the modules 
• that are to receive the template defining a desired 
...tuple* the mask will generally include the communi- 
cation module (a) Of that station. The message 
that is transmitted by the requesting processor 

10 . module includes the name of the tuple space that 

would contain the desired tuple, if it exists. The 
communication module stores the names of the tuple 
spaces that are associated with each of the other 
computer stations,, and if any of the other computer 

15 stations with which that communication module can 
communicate by way of its transceiver 174 is asso- 
ciated with the named tuple space, the communica- 
tion module will transmit the template over the 
fiber optic cable. If the message is received by a 

20 computer station that includes a processor that is 
associated with the named tuple space, the proces- 
sor of the receiving communication module directs 
the message over the parallel bus 13S to the 
appropriate processor module(s,.. If the message is 

25 received at one communication module of a proces- 
sor having a second communication module that is 
- connected to a computer .station that includes, a 
processor that is associated with the named tuple 
space, the processor of the receiving comounica- 

30 / tion module directs the message over the parallel 
bus 138 to. the other communication module for 
retransmission. Ultimately, the message reaches 
all computer stations having processors that are 
associated with the named tuple space. 
35 When a matching tuple is found in a computer 
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station other than the one that contains the 
requesting processor! the tuple is transmitted 
through the crossbar switch of the producing compu- 
ter station to a communication module of that sta- 
5 tion and is transmitted serially to a communication 
module of the requesting computer station- This 
might involve the tuple's passing through one or 
»ore intermediate computer stations, and being 
passed between communication nodules of an interme- 

10 diate station through the crossbar switch of that 
station* when the tuple reaches the requesting 
computer station, it is transmitted to the 
requesting processor module by way of the crossbar 
switch. since, as noted previously, e requesting 

15 processor issues its acknowledgment code at the 

beginning of a data word, there is no significant 
performance penalty with respect to transmission of 
tuples from computer station to computer station in 
the described network arrangements. 

20 It will be appreciated that tne present 

invention is not restricted to the particular 
embodiment that has been described and illustrated, 
and that variations may be made therein without 
departing from the scope of the invention as 

25 defined in the appended claims and equivalents 

thereof* For example, the invention is not 
restricted to use with any particular type of 
processor. It is necessary only that the processor 
be aole to support two communication modes, one 

30 of which is suitable for transmission of data 

pertaining to information required by the processor 
and the other of which is suitable for transmission 
of tne information itself. The invention is not 
restricted to use with computers configured for 

35 connection in a network. If the FIG* 1 computet 
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was designed for stand-alone use* it could 
accommodate up to thirty-two processor modules* 
Farther r the invention is not restricted to use of 
a crossbar switch to interconnect the link 
5 interfaces of the processors. Five processor 

modules each having four link interfaces could each 
have one link interface hard viced to a link 
interface of each other processor module, as shown 
in FIG* 4m I* this case, a producing processor is 
10 always able to transmit a tuple to a requesting 
processor by way of the link Interfaces by which 
they are connected. 
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Claims 

1. A computer which comprises a plurality of 
processor modules, each having at least first and 
second I/O connection interfaces, a processor con- 
nected to those interfaces, and read-write memory, 

5 a common bus to which the first I/O connection 

interface of each processor module is connected » 
switch means connected to the second I/O connection 
interface of each processor module and operative to 
connect the second I/O connection interface of a 

10 selected processor module selectively to the second 
i/o connection interface of another processor 
module, and a switch operator connected to the bus 
for receiving over the bus data pertaining to a 
first processor module, which requires access to 

15 information, and to a second processor module, from 
which the required information is available, and 
for controlling the switch means to allow the 
required information to be transmitted from the 
first processor module to the second processor 

20 module by way of the switch means, 

2. A computer according to claim 1, wherein 
the common bus is a parallel bus and the first I/O 
connection interface of each processor module is a 

25 parallel connection interface. 

3. A computer according to claim 1, wherein 
the second I/O connection interface of each proces- 
sor module is a link interface* 

30 



35 
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4, A computer according to c la In 1, further 
comprising an external communication module which 
includes an external communication interface and is 
connected to the bus and to the switch means and is 
5 operative to transmit data between the external 

communication interface and the processor modules 
;. by way of the switch means or the bus, 

5- A computer accord inn to claim 4# wherein 
10 . the external communication interface is a fiber 

optic transceiver* 

6- A computer system which comprises at least 
first and second computer station a each comprising: 

25 (a> a plurality of processor modules, 

each having at least first and second I/O connec- 
tion interfaces « a processor connected to those 
interfaces, and read-write memory; 

(bj a common bus to which the first I/O 

20 connection interface of each processor module is 

connected; 

(o) switch means having a plurality 
connection interfaces connected to the second I/O 
connection interfaces of the plurality of processor 

25 ' modules respectively and also having at least one 
additional connection interface, the switch means 
being operative to connect any one of its connec- 
tion interfaces to any of its other connection 
interfaces; 

30 (d) a switch operator connected to the 

bus for receiving over the bus data pertaining to a 
first processor module, which requires access to 
information, and to a second processor module, from 
which the required information is available, and 

35 for controlling the switch means to allow the 
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required information to be transmitted from the 
first processor module to the second processor 
module by way of the switch means; and 

(e) at least one external communication 
5 module which includes an external communication 
interface and is connected to the bus and to the 
switch means and is operative to transmit data 
between the external communication interface and 
the processor modules by way of the switch means ox 
10 the bus. 

7. A computer system according to claim 6« 
wherein the external communication interface of an 
external communication module of the first computer 

15 station is connected to the external communication 

interface of an external communication module of 
the second computer station* 

8. A computer system according to claim 6, 
20 wherein the external communication interface is a 

fiber optic transceiver. 

9. A computer system according to claim 6# 
wherein the common bus of each computer is a 

25 parallel bus and the first I/O connection interface 
of each processor module is a parallel connection 
interface. 

10. A computer system according to claim 6* 
30 wherein the second I/O connection interface of each 

processor module is a link interface and the 
additional connection interface of the switch means 
of each computer is a link interface. 

35 
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11* A computiE System according to claim 6, 
further comprising a user terminal having display 
moans, data input means* at least one processor 
connected t^o the display means and data input 
5 means and having at least one 1/0 connection 
interface* end external communication means 
r ; " connected to the I/O connection interface and 

having an external communication interface which is 
connected to an external communication interface 
10 of at least one computer station for transmitting 
data between 'said one computer station and the 
processor of the user terminal. 

X2. A method. Of operating a computer com- 
15 prising a plurality of processors each having at 
least first and second I/O connection Interfaces 
and read-write memory « the method comprising; 

(a) in the event that a first of said 
processors requires access to information that is 

20 . not present in its memory, transmitting a request 
for information to at least a second of said pro- 
cessors by way of the first I/O connection inter- 
faces of the first and second processors, and 

(b) in the event that the required infor- 
25 matron is contained in the memory of the second 

processor, transmitting the required information 
from the second processor to the first processor by 
way of the second I/O connection interfaces of the 
first and second processors. 



35 



PAGE 32/37 1 RCVD AT 1 1/10/2004 8: 31 :01 PM [Eastern Standard Time] * SVR:USPTO-EFXRF-1/0 * DNIS:8729306 * CSID:8064986673 * DURATION (mm-ss): 12*12 



11-1 1-J04 09:41 ?&- WINSTON HSU, NO. 41526 8064986673 T-167 P33/37 U- 

WO 89/09967 PCT/US89/0I456 

25 

13- A method of operating a compter, system 
which comprises at least first and second computer 
stations each comprising a plurality of processors 
each having at least first and second I/O connect 
' s tion interfaces and read-write memory* and each 

computer station also comprising external communi- 
cation means, the method comprising! 

(a) in the event that one processor of the 
first computer station requires access to informa- 

10 tion that is not present in the memory of any of 
the processors of the first computer station, 
transmitting a request for information to at least 
tha second computer station by way of the first I/O 
connection interface of said one processor and the 

15 external communication means of the first and 
second computer stations, and 

(b) in the event that the required infor- 
mation is contained in the memory of a processor of 
the second computer station, transmitting the 

20 required information from that processor to said 
one processor by way of the second I/O connection 
interface of the second processor, the external 
communication means of the second and first compu- 
ter stations, and the second I/O connector inter- 

25 face of the first processor. 



30 



35 



PAGE 33/37 * RCVD AT 1 111012004 8:31 :Q1 PM [Eastern Standard Time] * SVR:USPTO-EFXRF-1/0 * ONIS:8729306 * CSID:8064986673 ' DURATION (mm-s$):1M2 



11-11- '.04 09:41 WINSTON HSU, NO. 41526 8064986673 



T-167 P34/37 U- 



WO 89/69967 



18 



DISPLAY 



^20 

KEYBOARD "| — » 



DISC DRIVE 



24 

H 



30' — ^ 



28 



ROM 



26^ 



ZL 



48 



RAM 



ZL 



PCIYUS89/01456 



RAM 



60 



44< 



PROCESSOR 

r 



16 



42. 



^4/ 



PROCESSOR 



52 



P 



MUX/DEMUX TRANSCEIVER 



110 



PROCESSOR 
MODULES 



COMM 
MODULE 



- 144 



64 



52 
10 



COMM 
MODULE 



144 



110 



2l 



144* 



PROCESSOR 
MODULES 



COMM 
MODULE 



P. 

144 



12 



.12 



COMM 
MODULE 



14" 



DISC drive! 



ROM 



RAM 



FIG.l 



66 



160_ 
164 



PROCESSOR I 













166 



———31 FIBER OPTIC 

MUX/DEMUX[ «-» TRANSCEIVER 



PAGE 34^7 * RCVD AT 11/1012004 8:31:01 PM [Eastern Standard Time]* SVR:USPTO-EFXRF-1/0 * DNIS:8729306 * CSID:8064986673 * DURATION (mm-ss):12-12 



II-H-'jM 09:41 t£- WINSTON HSU, NO. 41526 8064956673 

l\ '«! » ft 



T-167 P35/37 U- 



WO 89/09967 



2/3 



PCT/US89/01456 




LINK 
SWITCH! 




LINK 
SWITCH 



FIG. 2 





» PROCESSOR 


MODULE 




• 






► PROCESSOR 


MODULE 


» i- 





COMMUNICATION MODULE 







PROCESSOR J; 






A 




^-170 


MUX/DEMUX M7 





-174 



FIBER OPTIC 
TRANSCEIVER 



COMMUNICATION MODULE 



y 



142 ^i CROSSBAR SWITCH 

OPERATOR 



SUBSTITUTE El'zEZT 



PAGE 35/37 1 RCVD AT 11110/2004 8:31 ;01 PM [Eastern Standard Time] * SVR:USPTO-EFXRF-1/0 * DNIS:8729306 * CSID:8064986673 • DURATION Jnm-ss):1M2 



11-11^04 09:42 ffi- WINSTON HSU, NO. 41526 8064986673 



T-167 P36/37 U-308 



WO 89/09967 



PCI7US89/01456; 



3/3 



a 



UjZ 



fl 

8 



=»p 



n 

8 



8& 



=»P 
8" 



c 



r 



s ° 
sp 



r 



SB 



Q_ 



< 
O 
CD 

>- 



CO 

d 



r 



g-\ s-^ a-^ 



UJ 

a 

O 

o 



Is 

si 



Of 

of 



«0- 



§1 



if 



to 5 

88 



»— i 



substitute sheet 



PAGE 3607 * RCVD AT 1 1/1 0/2004 8:31:01 PM [Eastern Standard Tine] ' SVR:LSPTO^FXRF-1/0 ' DN1S:8729306 « CSID:80649B6673 * DURATION (mm-ss):12-12 



09:42 tg- WINSTON HSU, NO. 41526 8064986673 



T-167 P37/37 U 



INTERNATIONAL SEARCH REPORT 

International AcglicaUcn Nq, PCT /OS 8 9 / 0 J 4 5 6 



L CLASSIFICATION Of SUBJECT MATTtW (H s 



According la Inferiwtwnaf Patent CteaelAcafJoo (lf»Oof lo MM National Clarification and tPC 

IPC(4): G06F 13/38 G06F 13/42, C06F 15/56 
fltfit qi, 36t/?POi 370/60 



Minimum Documentation 8»«rcr»o ? 



Classification Symbols 



17,8, 



364/200 MS FILE. 364/900 MS FILE • 340/825.52 
370/60, 370/67 



to the E*ter4 the! Aoeh Ooci 



i are Indue** l» the FtoUs Scorched « 



r. of the rw torso} p 



V, P 

A 
A 
A 

A. P 

A 

-A * * 



US. 

vs. 

us, 

us, 
US. 

s;s, 

US. 



A, 4,627.045 
See figs. 3 , 2 
col. 7, lines 

A, 4.811,210 
See figs. 'J, 4, 
col. 30, lines 

A« 4 ,720,780 

A, 4, 644, 496 

A,4,417,334 

A, 4, 794, 59*] 

4,4,633,473 

A, 4, 748,560 



l Olson *t al) 02 Dec., 1986 
. col. 7, lines 1 - £, 
IB - 63, col. 8, lines 19 - 2: 



IMcAulayl 07 Mar. , 1989 
5A, 26, col. 8, Lines 14 
16 - 22. 



34. 



(Dolecek) 19 Jan., 1988 

( Andrew-} 17 Feb. , 1987 

{Guivierson et all 22 Nov., 1983 

<Picard» 27 Dec., 1988 

{ Ratchf ord et <U.) 30 Dec., 1986 

(Kataokal 31 May, 1988 



• Special ceiaeorie* of <M*i Cocofnerrta: * 

-A - eocuraarw defining the notvaral slate of tha art which I* not 

conaktemd to to* 0* particular relevance 
T earlier oecamam but published oo or attar tl»a |pte»nattonel ' 

Bane data 

«*L" document which M( throw dootrta on prior** Cl*«m(s) or 
nnicft I* ened to eatabllah the publication date of air— — 



"T* later document ouhliahed afltr tha Inter natipnai finn<j ««• 
or priorhy dale and not In conflM with the application but 
cued to understand the orinctpla or theory undadytno- the 



-X- document of particular relevance; thft claimed Irw.njtan 
cannot ba coneiderecS n**«l or unnol be considered 10 



"0- document referrino to en oral diaefteeure. aea. **tribia<m or 
at tier mean* 

**" ttwttan thS b priO^, P d 



-Y- document of particular tha claimed 



"4" document member of the sam* patent family 



IV. CCRTIFlCATlOM 



Date of the Actual Completion of U 

12 JUNE 1989 



Data of Walling of thU In tarnation al Search Report 

2 7 JUL 1989 



Imanwttonal Searching Authority 



SiQjiature of Autborited 

THOMAS 



theritad Officer/^) 



90teMi47) 



PAGE 37/37 * RCVD AT 1 1/10/2004 8:31 :01 PM [Eastern Standard Time] * SVMSPTO-EFXRM/O * DNIS:8729306 * CSID:8064986673 ' DURATION (mm.ss):1M2 



